Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 2191 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 205.5 KiB |
| Average record size in memory | 96.1 B |
Variable types
| DateTime | 1 |
|---|---|
| Categorical | 1 |
| Numeric | 10 |
PM_RETIRO is highly correlated with PM_CIUDADLINEAL | High correlation |
PM_CIUDADLINEAL is highly correlated with PM_RETIRO | High correlation |
DEW_POINT is highly correlated with TEMPERATURE | High correlation |
TEMPERATURE is highly correlated with DEW_POINT | High correlation |
COMMULATIVE_PRECIPITATION is highly skewed (γ1 = 46.80810625) | Skewed |
FECHA has unique values | Unique |
PM_RETIRO has 1115 (50.9%) zeros | Zeros |
PM_VALLECAS has 1250 (57.1%) zeros | Zeros |
PM_CIUDADLINEAL has 1118 (51.0%) zeros | Zeros |
PM_CENTRO has 36 (1.6%) zeros | Zeros |
COMMULATIVE_PRECIPITATION has 1750 (79.9%) zeros | Zeros |
Reproduction
| Analysis started | 2021-05-04 15:10:53.360340 |
|---|---|
| Analysis finished | 2021-05-04 15:11:15.887157 |
| Duration | 22.53 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 2191 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.2 KiB |
| Minimum | 2010-01-01 00:00:00 |
|---|---|
| Maximum | 2015-12-31 00:00:00 |
Histogram with fixed size bins (bins=50)
SEASON
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.2 KiB |
| 2 | |
|---|---|
| 1 | |
| 3 | |
| 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2191 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 4 |
| 3rd row | 4 |
| 4th row | 4 |
| 5th row | 4 |
| Value | Count | Frequency (%) |
| 2 | 552 | |
| 1 | 552 | |
| 3 | 546 | |
| 4 | 541 |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 1 | 552 | |
| 2 | 552 | |
| 3 | 546 | |
| 4 | 541 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 552 | |
| 2 | 552 | |
| 3 | 546 | |
| 4 | 541 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2191 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 552 | |
| 2 | 552 | |
| 3 | 546 | |
| 4 | 541 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2191 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 1 | 552 | |
| 2 | 552 | |
| 3 | 546 | |
| 4 | 541 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2191 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 552 | |
| 2 | 552 | |
| 3 | 546 | |
| 4 | 541 |
| Distinct | 998 |
|---|---|
| Distinct (%) | 45.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43.92409122 |
|---|---|
| Minimum | 0 |
| Maximum | 564.7083333 |
| Zeros | 1115 |
| Zeros (%) | 50.9% |
| Memory size | 17.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 66.45833333 |
| 95-th percentile | 183.0528846 |
| Maximum | 564.7083333 |
| Range | 564.7083333 |
| Interquartile range (IQR) | 66.45833333 |
Descriptive statistics
| Standard deviation | 68.72708376 |
|---|---|
| Coefficient of variation (CV) | 1.564678559 |
| Kurtosis | 6.430955004 |
| Mean | 43.92409122 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.243863554 |
| Sum | 96237.68387 |
| Variance | 4723.412042 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 1115 | |
| 31.875 | 4 | 0.2% |
| 14.375 | 3 | 0.1% |
| 26.56521739 | 3 | 0.1% |
| 15.79166667 | 3 | 0.1% |
| 18.83333333 | 3 | 0.1% |
| 43.91666667 | 3 | 0.1% |
| 29.625 | 2 | 0.1% |
| 64.70833333 | 2 | 0.1% |
| 24.95833333 | 2 | 0.1% |
| Other values (988) | 1051 |
| Value | Count | Frequency (%) |
| 0 | 1115 | |
| 3 | 1 | < 0.1% |
| 3.291666667 | 1 | < 0.1% |
| 3.541666667 | 1 | < 0.1% |
| 3.545454545 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 564.7083333 | 1 | |
| 488.9166667 | 1 | |
| 439.9166667 | 1 | |
| 416.6666667 | 1 | |
| 399.7083333 | 1 |
| Distinct | 881 |
|---|---|
| Distinct (%) | 40.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.66228327 |
|---|---|
| Minimum | 0 |
| Maximum | 593 |
| Zeros | 1250 |
| Zeros (%) | 57.1% |
| Memory size | 17.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 61.4682971 |
| 95-th percentile | 179.9089027 |
| Maximum | 593 |
| Range | 593 |
| Interquartile range (IQR) | 61.4682971 |
Descriptive statistics
| Standard deviation | 67.35235951 |
|---|---|
| Coefficient of variation (CV) | 1.698146298 |
| Kurtosis | 8.38567089 |
| Mean | 39.66228327 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.495710493 |
| Sum | 86900.06264 |
| Variance | 4536.340331 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 1250 | |
| 101.0416667 | 3 | 0.1% |
| 112 | 3 | 0.1% |
| 45.29166667 | 3 | 0.1% |
| 55 | 3 | 0.1% |
| 24.41666667 | 2 | 0.1% |
| 116.7083333 | 2 | 0.1% |
| 77.08333333 | 2 | 0.1% |
| 22.375 | 2 | 0.1% |
| 63.875 | 2 | 0.1% |
| Other values (871) | 919 |
| Value | Count | Frequency (%) |
| 0 | 1250 | |
| 4.733333333 | 1 | < 0.1% |
| 4.9 | 1 | < 0.1% |
| 4.958333333 | 1 | < 0.1% |
| 5.782608696 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 593 | 1 | |
| 518.047619 | 1 | |
| 442.625 | 1 | |
| 439.4583333 | 1 | |
| 397.5833333 | 1 |
| Distinct | 1003 |
|---|---|
| Distinct (%) | 45.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43.54700502 |
|---|---|
| Minimum | 0 |
| Maximum | 510.0434783 |
| Zeros | 1118 |
| Zeros (%) | 51.0% |
| Memory size | 17.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 64.91666667 |
| 95-th percentile | 185.1958333 |
| Maximum | 510.0434783 |
| Range | 510.0434783 |
| Interquartile range (IQR) | 64.91666667 |
Descriptive statistics
| Standard deviation | 69.05689528 |
|---|---|
| Coefficient of variation (CV) | 1.585801256 |
| Kurtosis | 6.382821052 |
| Mean | 43.54700502 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.288068084 |
| Sum | 95411.488 |
| Variance | 4768.854785 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 1118 | |
| 71.58333333 | 3 | 0.1% |
| 33.54166667 | 3 | 0.1% |
| 23.66666667 | 3 | 0.1% |
| 37.5 | 2 | 0.1% |
| 23.58333333 | 2 | 0.1% |
| 60.625 | 2 | 0.1% |
| 96.45833333 | 2 | 0.1% |
| 57.70833333 | 2 | 0.1% |
| 23.41666667 | 2 | 0.1% |
| Other values (993) | 1052 |
| Value | Count | Frequency (%) |
| 0 | 1118 | |
| 3.842105263 | 1 | < 0.1% |
| 4.857142857 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 5.833333333 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 510.0434783 | 1 | |
| 466.7916667 | 1 | |
| 423.2916667 | 1 | |
| 417.4666667 | 1 | |
| 415.8333333 | 1 |
| Distinct | 1801 |
|---|---|
| Distinct (%) | 82.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 94.3325817 |
|---|---|
| Minimum | 0 |
| Maximum | 568.5652174 |
| Zeros | 36 |
| Zeros (%) | 1.6% |
| Memory size | 17.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 13.70833333 |
| Q1 | 37.5327381 |
| median | 73.375 |
| Q3 | 124.6666667 |
| 95-th percentile | 250.3125 |
| Maximum | 568.5652174 |
| Range | 568.5652174 |
| Interquartile range (IQR) | 87.13392857 |
Descriptive statistics
| Standard deviation | 78.00042362 |
|---|---|
| Coefficient of variation (CV) | 0.8268662027 |
| Kurtosis | 3.759383961 |
| Mean | 94.3325817 |
| Median Absolute Deviation (MAD) | 41.5 |
| Skewness | 1.680332444 |
| Sum | 206682.6865 |
| Variance | 6084.066085 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 36 | 1.6% |
| 43.54166667 | 4 | 0.2% |
| 22.04166667 | 4 | 0.2% |
| 25.08333333 | 4 | 0.2% |
| 14.41666667 | 3 | 0.1% |
| 67.45833333 | 3 | 0.1% |
| 43.41666667 | 3 | 0.1% |
| 25.625 | 3 | 0.1% |
| 122.2916667 | 3 | 0.1% |
| 54.625 | 3 | 0.1% |
| Other values (1791) | 2125 |
| Value | Count | Frequency (%) |
| 0 | 36 | |
| 3.181818182 | 1 | < 0.1% |
| 6.083333333 | 1 | < 0.1% |
| 6.333333333 | 1 | < 0.1% |
| 6.541666667 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 568.5652174 | 1 | |
| 537.25 | 1 | |
| 492.75 | 1 | |
| 464.375 | 1 | |
| 449.75 | 1 |
| Distinct | 998 |
|---|---|
| Distinct (%) | 45.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.074797095 |
|---|---|
| Minimum | -33.33333333 |
| Maximum | 26.20833333 |
| Zeros | 2 |
| Zeros (%) | 0.1% |
| Memory size | 17.2 KiB |
Quantile statistics
| Minimum | -33.33333333 |
|---|---|
| 5-th percentile | -20.29166667 |
| Q1 | -9.75 |
| median | 2.291666667 |
| Q3 | 15.08333333 |
| 95-th percentile | 21.95833333 |
| Maximum | 26.20833333 |
| Range | 59.54166667 |
| Interquartile range (IQR) | 24.83333333 |
Descriptive statistics
| Standard deviation | 13.9572431 |
|---|---|
| Coefficient of variation (CV) | 6.727040027 |
| Kurtosis | -1.218974946 |
| Mean | 2.074797095 |
| Median Absolute Deviation (MAD) | 12.45833333 |
| Skewness | -0.1381714527 |
| Sum | 4545.880435 |
| Variance | 194.8046351 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -9.75 | 7 | 0.3% |
| 18.95833333 | 7 | 0.3% |
| 16.79166667 | 7 | 0.3% |
| 16.91666667 | 7 | 0.3% |
| 20 | 7 | 0.3% |
| 21.25 | 7 | 0.3% |
| 14.91666667 | 6 | 0.3% |
| 19.83333333 | 6 | 0.3% |
| 22.25 | 6 | 0.3% |
| -7.25 | 6 | 0.3% |
| Other values (988) | 2125 |
| Value | Count | Frequency (%) |
| -33.33333333 | 1 | |
| -31.70833333 | 1 | |
| -27.45833333 | 1 | |
| -27.20833333 | 1 | |
| -26.625 | 1 |
| Value | Count | Frequency (%) |
| 26.20833333 | 2 | |
| 25.625 | 1 | |
| 25.375 | 1 | |
| 25.33333333 | 1 | |
| 25.25 | 1 |
HUMIDITY
Real number (ℝ≥0)
| Distinct | 1296 |
|---|---|
| Distinct (%) | 59.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.38462695 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros | 11 |
| Zeros (%) | 0.5% |
| Memory size | 17.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 21.45833333 |
| Q1 | 37.70833333 |
| median | 55.04166667 |
| Q3 | 70.89583333 |
| 95-th percentile | 86.70710784 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 33.1875 |
Descriptive statistics
| Standard deviation | 20.60985512 |
|---|---|
| Coefficient of variation (CV) | 0.3789647235 |
| Kurtosis | -0.8250369418 |
| Mean | 54.38462695 |
| Median Absolute Deviation (MAD) | 16.5 |
| Skewness | -0.1122573834 |
| Sum | 119156.7177 |
| Variance | 424.766128 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 11 | 0.5% |
| 49 | 6 | 0.3% |
| 58.45833333 | 6 | 0.3% |
| 80.125 | 6 | 0.3% |
| 57.04166667 | 6 | 0.3% |
| 37.58333333 | 5 | 0.2% |
| 40.58333333 | 5 | 0.2% |
| 35.16666667 | 5 | 0.2% |
| 67.04166667 | 5 | 0.2% |
| 54.33333333 | 5 | 0.2% |
| Other values (1286) | 2131 |
| Value | Count | Frequency (%) |
| 0 | 11 | |
| 6.75 | 1 | < 0.1% |
| 8.333333333 | 1 | < 0.1% |
| 8.916666667 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 2 | |
| 99 | 1 | |
| 97.95833333 | 1 | |
| 96.95833333 | 1 | |
| 96.08333333 | 1 |
PREASSURE
Real number (ℝ≥0)
| Distinct | 852 |
|---|---|
| Distinct (%) | 38.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1011.378024 |
|---|---|
| Minimum | 0 |
| Maximum | 1043.458333 |
| Zeros | 11 |
| Zeros (%) | 0.5% |
| Memory size | 17.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1000.4375 |
| Q1 | 1007.875 |
| median | 1016.25 |
| Q3 | 1024.5625 |
| 95-th percentile | 1032.729167 |
| Maximum | 1043.458333 |
| Range | 1043.458333 |
| Interquartile range (IQR) | 16.6875 |
Descriptive statistics
| Standard deviation | 72.5619767 |
|---|---|
| Coefficient of variation (CV) | 0.07174565296 |
| Kurtosis | 187.0859274 |
| Mean | 1011.378024 |
| Median Absolute Deviation (MAD) | 8.375 |
| Skewness | -13.60847839 |
| Sum | 2215929.25 |
| Variance | 5265.240463 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 11 | 0.5% |
| 1020.875 | 8 | 0.4% |
| 1007.833333 | 8 | 0.4% |
| 1014 | 8 | 0.4% |
| 1019.125 | 7 | 0.3% |
| 1027.208333 | 7 | 0.3% |
| 1007.875 | 7 | 0.3% |
| 1002.041667 | 7 | 0.3% |
| 1015.916667 | 7 | 0.3% |
| 1020.666667 | 7 | 0.3% |
| Other values (842) | 2114 |
| Value | Count | Frequency (%) |
| 0 | 11 | |
| 994.0416667 | 1 | < 0.1% |
| 994.4583333 | 2 | 0.1% |
| 994.8333333 | 1 | < 0.1% |
| 994.9583333 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1043.458333 | 1 | |
| 1041.708333 | 1 | |
| 1039.708333 | 1 | |
| 1039.583333 | 1 | |
| 1039.5 | 1 |
| Distinct | 1526 |
|---|---|
| Distinct (%) | 69.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.89117975 |
|---|---|
| Minimum | -4.277322404 |
| Maximum | 34.5204918 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 17.2 KiB |
Quantile statistics
| Minimum | -4.277322404 |
|---|---|
| 5-th percentile | 3.321721311 |
| Q1 | 8.888661202 |
| median | 19.08333333 |
| Q3 | 26.63114754 |
| 95-th percentile | 30.45628415 |
| Maximum | 34.5204918 |
| Range | 38.79781421 |
| Interquartile range (IQR) | 17.74248634 |
Descriptive statistics
| Standard deviation | 9.385066363 |
|---|---|
| Coefficient of variation (CV) | 0.5245638629 |
| Kurtosis | -1.306470141 |
| Mean | 17.89117975 |
| Median Absolute Deviation (MAD) | 8.538251366 |
| Skewness | -0.2101688121 |
| Sum | 39199.57484 |
| Variance | 88.07947064 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 14.60928962 | 7 | 0.3% |
| 27.31420765 | 7 | 0.3% |
| 28.47540984 | 6 | 0.3% |
| 21.84972678 | 5 | 0.2% |
| 27.99726776 | 5 | 0.2% |
| 3.543715847 | 5 | 0.2% |
| 29.09016393 | 5 | 0.2% |
| 28.8510929 | 5 | 0.2% |
| 9.827868852 | 4 | 0.2% |
| 21.06420765 | 4 | 0.2% |
| Other values (1516) | 2138 |
| Value | Count | Frequency (%) |
| -4.277322404 | 1 | |
| -2.706284153 | 1 | |
| -2.672131148 | 1 | |
| -2.603825137 | 1 | |
| -2.023224044 | 1 |
| Value | Count | Frequency (%) |
| 34.5204918 | 1 | |
| 34.11065574 | 1 | |
| 34.11065574 | 1 | |
| 33.56420765 | 2 | |
| 32.98360656 | 1 |
WIND_SPEED
Real number (ℝ≥0)
| Distinct | 2156 |
|---|---|
| Distinct (%) | 98.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.26215979 |
|---|---|
| Minimum | 1.244583333 |
| Maximum | 463.1879167 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 17.2 KiB |
Quantile statistics
| Minimum | 1.244583333 |
|---|---|
| 5-th percentile | 2.501875 |
| Q1 | 5.679583333 |
| median | 10.70875 |
| Q3 | 21.58041667 |
| 95-th percentile | 84.67416667 |
| Maximum | 463.1879167 |
| Range | 461.9433333 |
| Interquartile range (IQR) | 15.90083333 |
Descriptive statistics
| Standard deviation | 41.02850686 |
|---|---|
| Coefficient of variation (CV) | 1.76374452 |
| Kurtosis | 29.34404446 |
| Mean | 23.26215979 |
| Median Absolute Deviation (MAD) | 6.113333333 |
| Skewness | 4.769878438 |
| Sum | 50967.3921 |
| Variance | 1683.338375 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.9725 | 3 | 0.1% |
| 6.05 | 2 | 0.1% |
| 11.43416667 | 2 | 0.1% |
| 5.384166667 | 2 | 0.1% |
| 11.19666667 | 2 | 0.1% |
| 8.084166667 | 2 | 0.1% |
| 1.934166667 | 2 | 0.1% |
| 6.05375 | 2 | 0.1% |
| 5.60625 | 2 | 0.1% |
| 1.655 | 2 | 0.1% |
| Other values (2146) | 2170 |
| Value | Count | Frequency (%) |
| 1.244583333 | 1 | |
| 1.4125 | 1 | |
| 1.484583333 | 1 | |
| 1.486666667 | 1 | |
| 1.50375 | 1 |
| Value | Count | Frequency (%) |
| 463.1879167 | 1 | |
| 407.3533333 | 1 | |
| 384.4254167 | 1 | |
| 365.43875 | 1 | |
| 365.4116667 | 1 |
| Distinct | 179 |
|---|---|
| Distinct (%) | 8.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 458.2136011 |
|---|---|
| Minimum | 0 |
| Maximum | 999990 |
| Zeros | 1750 |
| Zeros (%) | 79.9% |
| Memory size | 17.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 10.9 |
| Maximum | 999990 |
| Range | 999990 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 21363.56483 |
|---|---|
| Coefficient of variation (CV) | 46.62359385 |
| Kurtosis | 2190.999207 |
| Mean | 458.2136011 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 46.80810625 |
| Sum | 1003946 |
| Variance | 456401902.4 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 1750 | |
| 0.1 | 48 | 2.2% |
| 0.2 | 31 | 1.4% |
| 0.4 | 15 | 0.7% |
| 0.6 | 10 | 0.5% |
| 0.3 | 9 | 0.4% |
| 0.8 | 9 | 0.4% |
| 0.9 | 8 | 0.4% |
| 0.7 | 8 | 0.4% |
| 0.5 | 6 | 0.3% |
| Other values (169) | 297 | 13.6% |
| Value | Count | Frequency (%) |
| 0 | 1750 | |
| 0.1 | 48 | 2.2% |
| 0.2 | 31 | 1.4% |
| 0.3 | 9 | 0.4% |
| 0.4 | 15 | 0.7% |
| Value | Count | Frequency (%) |
| 999990 | 1 | |
| 223 | 1 | |
| 203.6 | 1 | |
| 102.3 | 1 | |
| 75.8 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| FECHA | SEASON | PM_RETIRO | PM_VALLECAS | PM_CIUDADLINEAL | PM_CENTRO | DEW_POINT | HUMIDITY | PREASSURE | TEMPERATURE | WIND_SPEED | COMMULATIVE_PRECIPITATION | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2010-01-01 | 4 | 0.0 | 0.0 | 0.0 | 129.000000 | -18.750000 | 38.458333 | 1017.083333 | 2.040984 | 14.458333 | 0.0 |
| 1 | 2010-01-02 | 4 | 0.0 | 0.0 | 0.0 | 144.333333 | -8.500000 | 77.937500 | 1024.750000 | 3.372951 | 24.860000 | 0.0 |
| 2 | 2010-01-03 | 4 | 0.0 | 0.0 | 0.0 | 78.375000 | -10.125000 | 87.916667 | 1022.791667 | 0.572404 | 70.937917 | 11.2 |
| 3 | 2010-01-04 | 4 | 0.0 | 0.0 | 0.0 | 29.291667 | -20.875000 | 46.208333 | 1029.291667 | -1.852459 | 111.160833 | 0.0 |
| 4 | 2010-01-05 | 4 | 0.0 | 0.0 | 0.0 | 43.541667 | -24.583333 | 42.041667 | 1033.625000 | -4.277322 | 56.920000 | 0.0 |
| 5 | 2010-01-06 | 4 | 0.0 | 0.0 | 0.0 | 59.375000 | -23.708333 | 39.208333 | 1033.750000 | -2.706284 | 18.511667 | 0.0 |
| 6 | 2010-01-07 | 4 | 0.0 | 0.0 | 0.0 | 72.458333 | -21.250000 | 49.000000 | 1034.083333 | -2.672131 | 10.170000 | 0.0 |
| 7 | 2010-01-08 | 4 | 0.0 | 0.0 | 0.0 | 174.333333 | -17.125000 | 64.541667 | 1028.000000 | -2.023224 | 1.972917 | 0.0 |
| 8 | 2010-01-09 | 4 | 0.0 | 0.0 | 0.0 | 84.750000 | -16.333333 | 57.250000 | 1029.041667 | 0.094262 | 13.298750 | 0.0 |
| 9 | 2010-01-10 | 4 | 0.0 | 0.0 | 0.0 | 55.083333 | -15.958333 | 56.500000 | 1032.500000 | 0.401639 | 17.415833 | 0.0 |
Last rows
| FECHA | SEASON | PM_RETIRO | PM_VALLECAS | PM_CIUDADLINEAL | PM_CENTRO | DEW_POINT | HUMIDITY | PREASSURE | TEMPERATURE | WIND_SPEED | COMMULATIVE_PRECIPITATION | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2181 | 2015-12-22 | 4 | 331.666667 | 351.235294 | 327.666667 | 336.958333 | -4.416667 | 89.166667 | 1027.958333 | 5.217213 | 3.479167 | 0.0 |
| 2182 | 2015-12-23 | 4 | 275.041667 | 257.217391 | 250.125000 | 254.541667 | -5.958333 | 70.458333 | 1026.500000 | 7.266393 | 6.203333 | 0.0 |
| 2183 | 2015-12-24 | 4 | 125.347826 | 119.916667 | 108.727273 | 100.416667 | -6.750000 | 64.208333 | 1027.000000 | 7.334699 | 4.504167 | 0.0 |
| 2184 | 2015-12-25 | 4 | 564.708333 | 518.047619 | 510.043478 | 537.250000 | -4.000000 | 96.083333 | 1019.250000 | 4.704918 | 2.267083 | 0.0 |
| 2185 | 2015-12-26 | 4 | 266.521739 | 255.083333 | 238.208333 | 254.333333 | -5.041667 | 86.583333 | 1024.916667 | 5.114754 | 4.301250 | 0.0 |
| 2186 | 2015-12-27 | 4 | 52.791667 | 66.041667 | 57.708333 | 56.208333 | -13.958333 | 53.541667 | 1038.625000 | 2.928962 | 3.950833 | 0.0 |
| 2187 | 2015-12-28 | 4 | 117.416667 | 119.583333 | 111.833333 | 112.416667 | -11.458333 | 60.750000 | 1035.041667 | 4.056011 | 13.656667 | 0.0 |
| 2188 | 2015-12-29 | 4 | 323.416667 | 361.500000 | 330.750000 | 331.875000 | -6.625000 | 76.125000 | 1028.875000 | 5.285519 | 1.244583 | 0.0 |
| 2189 | 2015-12-30 | 4 | 51.791667 | 135.500000 | 94.291667 | 101.750000 | -8.750000 | 58.458333 | 1030.375000 | 7.300546 | 26.502500 | 0.0 |
| 2190 | 2015-12-31 | 4 | 63.826087 | 83.166667 | 61.304348 | 70.875000 | -10.083333 | 59.416667 | 1032.458333 | 5.251366 | 9.073333 | 0.0 |